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Abstract 

We show that any L\ embedding of the transportation cost (a.k.a. Earthmover) metric on probability 
measures supported on the grid {0, 1, ... , n} 2 c R 2 incurs distortion Q. ( sjlognj. We also use Fourier 
analytic techniques to construct a simple L\ embedding of this space which has distortion 0(\ogn). 

1 Introduction 

For a finite metric space (X, dx) we denote by &x the space of all probability measures on X. The trans- 
portation cost distance (also known as the Earthmover distance in the computer vision/graphics literature) 
between two probability measures fi, v e & x is defined by 

t(ji, v) = min I ^ d x (x, y)n(x, y) : Vx, y e X, n(x, y) > 0, ^ n(x, z) = fi(x), ^ n(z, y) = v(y) I . 

[x,yeX z.eX z.eX j 

Observe that if ju and v are the uniform probablity distribution over &-point subsets A c X and B c X, 
respectively, then 



r(ju, v) = min \ - V* dx(a,f{a)) : f : A — > B is a bijection 



(i) 



This quantity is also known as the minimum weight matching between A and B, corresponding to the weight 
function dx(-,-) (see [42 1). Thus, the Earthmover distance is a natural measure of similarity between im- 
ages I42II15IIT41 - the distance is the optimal way to match various features, where the cost of such a matching 
corresponds to the sum of the distances between the features that were matched. Indeed, such metrics oc- 
cur in various contexts in computer science: Apart from being a popular distance measure in graphics and 
vision B2l[T5llT4ll26l . they are used as LP relaxations for classification problems such as 0-extension and 
metric labelling [9]|8]|2l. Transportation cost metrics are also prevalent in several areas of analysis and PDE 
(see the book [53 1 and the references therein). 

Following extensive work on nearest neighbor search and data stream computations for L\ metrics 
(see 13241 l20l IT9l ITOl l22l ). it became of great interest to obtain low distortion embeddings of useful metrics 
into L\ (here, and in what follows, L\ denotes the space of all Lebesgue measurable functions / : [0, 1] — > R, 
such that ll/H i := J Q \f{t)\dt < oo). Indeed, such embeddings can be used to construct approximate nearest 
'Supported in part by the Israel Science Foundation. 
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neighbor databases, with an approximation guarantee depending on the distortion of the embedding (we are 
emphasizing here only one aspect of the algorithmic applications of low distortion embeddings into L\- they 
are also crucial for the study of various cut problems in graphs, and we refer the reader to [36j|23]|^ for a 
discussion of these issues). 

In the context of the Earthmover distance, nearest neighbor search (a.k.a. similarity search in the vision 
literature) is of particular importance. It was therefore asked (see, e.g. 1351 ) whether the Earthmover distance 
embeds into L\ with constant distortion (the best known upper bounds on the L\ distortion were obtained 
in [ 8 26 ], and will be discussed further below). In [ 30 1 the case of the Hamming cube was settled negatively: 
It is shown there that any embedding of the Earthmover distance on {0, \\ d (equipped with the L\ metric) 
incurs distortion Qid). However, the most interesting case is that of the Earthmover distance on R 2 , as this 
corresponds to a natural similarity measure between images lITH (indeed, the case of the L\ embeddability 
of planar Earthmover distance was explicitly asked in [35 1). Here we settle this problem negatively by 
obtaining the first super-constant lower bound on the L\ distortion of the planar Earthmover distance. To 
state it we first recall some definitions. 

Given two metric spaces (X, dx) and (F, dy), and a mapping / : X — > Y, we denote its Lipschitz constant 

by 

..... d Y (f(x),f(y)) 
1 1/| bp := sup — — — . 

x,yeX d X {x,y) 
x+y 

If / is one to one then its distortion is defined as 

a- „t\ \\f\\ \\f-\w d Y (f(x),f(y)) d x (x,y) 
dist(/) := \\f\\ Lip ■ 11/ Mbp = sup — — sup ■ 

x,yex d x (x,y) x ,yexdy(f(x),f(y)) 

x+y x+y 

The smallest distortion with which X can be embedded into Y is denoted cy{X), i.e., 

cy{X) := inf (dist(/) : / : X <—> Y is one to one) . 

When Y = L p we use the shorter notation cy(X) = c p (X). Thus, the parameter C2(X) is the Euclidean 
distortion of X and c\{X) is the L\ distortion of X. 

Our main result bounds from below the L\ distortion of the space of probability measures on the n by n 
grid, equipped with the transportation cost distance. 

Theorem 1.1. c\ (^{o,i,...,«p> T ) = & ( -v/log nj. 

After reducing the problem to a functional analytic question, our proof of Theorem II .11 is a discretiza- 
tion of a theorem of Kislyakov from 1975 l32l . We attempted to make the presentation self contained by 
presenting here appropriate versions of the various functional anlaytic lemmas that are used in the proof. 

For readers who are more interested in the minimum cost matching metric Q, we also prove the follow- 
ing lower bound: 

Theorem 1.2 (Discretization). For arbitrarily large integers n there is a family W of disjoint n-point subsets 
of{0, 1....B 3 ) , with \<3f\ < „oaoglogn) ) such that 

any L\ embedding of W , equipped with the minimum 



weight matching metric r, incurs distortion 

QfVlogloglogn) - Q(Vlogloglogm) 
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A metric spaces {X,dx) is said to embed into squared L 2 , or to be of negative type, if the metric space 
(x, V^x) is isometric to a subset of L 2 . Squared L 2 metrics are important in various algorithmic applications 
since it is possible to efficiently solve certain optimization problems on them using semidefinite program- 
ming (see the discussion in l3l l3*Tl ). It turns out that planar Earthmover does not embed into any squared L 2 
metric: 

Theorem 1.3 (Nonembeddability into squared L 2 ). lim^^oo c 2 (^{o,...,*!) 2 * V*") - 00 • 



Motivated by the proof of Theorem ll.il we also construct simple low-distortion embeddings of the space 
(^{0,i,...,«} 2 ' T ) i nto It i s convenient to work with probability measures on the torus Z 2 instead of the grid 
{0, 1, . . . , n} 2 . One easily checks that {0, . . . , n} 2 embeds with constant distortion into Z? (see e.g. Lemma 
6.12 in |3*71D . Every p € can be written in the Fourier basis as 

p- j"( M > v ) e "v, (2) 

where 

>~\ 1m{ait-vhv) \ ^ y 

V(a, b), (u, v) e Z n , e uv (a, b) := e " , and V(M,v)eZ„, p(u,v) := — / p(a,b)e uv (-a,-b). 

(a,b)eZ 2 n 

Observe that for n = 2 k + 1 , k e N, the decomposition © can be computed in time O (n 2 log nj using the 
fast Fourier transform [45 1. Motivated in part by the results of [ 40 1 (see also EJ|^), we define 

2tt/» j 

A ^= 2 j 2^ 6 a j anv j2-K«.v)-e w , (3) 

( H ,v)eZ 2 \{(0,0)} \ e " - 1| +\ e " ~ M 

and 

2jTf"t' . 

^ ~ E 1 2= V 7 ^ j5-?(",v)-e w . (4) 

(«, V )ez 2 \{(o,o)} r " - i| + \ e " - 1 \ 

Theorem 1.4. The mapping p i-> (Ap, Bp) from (=^z 2 ' T ) to ^i (Z^J © Li fz^J is bi-Lipschitz, with distortion 
OQogn). 

The 0(logn) distortion in Theorem 11.41 matches the best known distortion guarantee proved in ll26l lSl. 
But, our embedding has various new features. First of all, it is a linear mapping into a low dimensional L\ 
space, which is based on the computation of the Fourier transform. It is thus very fast to compute, and is 
versatile in the sense that it might behave better on images whose Fourier transform is sparse (we do not 
study this issue here). Thus there is scope to apply the embedding on certain subsets of the frequencies, and 
this might improve the performance in practice. This is an interesting "applied" question which should be 
investigated further (see the "Discussion and open problems" section). 



3 



2 Preliminaries and notation 



For the necessary background on measure theory we refer to the book [46], however, in the setting of the 
present paper, our main results will deal with finitely supported measures, in which case no background and 
measurabilty assumptions are necessary. We also refer to the book l53ll for background on the theory of 
optimal transportation of measures. Let (X, dx) be a metric space. We denote by the space of all Borel 
measures on X with bounded total variation, and by &x £ ^x the set of all Borel probability measures 
on X. We also let jtf^ c „# x be the space of non-negative measures on X with finite total mass, and we 
denote by c ^ x the space of all measures p e Mx with p(X) - 0. Given a measure p e ^x, we can 
decompose it in a unique way as p = p + - p~, where p + ,p~ € are disjointly supported. If p, v e 
have the same total mass, i.e. p(X) = v(X) < oo, then we let HQi, v) be the space of all couplings of p and v, 
i.e. all non-negative Borel measures n on X x X such that for every measurable bounded / : X — > R, 

I f(x)dn(x,y) = I f(x)dp(x), and I f(y)dn(x,y) = I f(y)dv(y). 

JXxX JX JXxX JX 

Observe that in the case of finitely supported measures, this condition translates to the standard formulation, 
in which we require that the marginals of n are p and v, i.e. 

Vx, y e X, 2_j n(x, z) = M*)> and 2j n(y1, y ^ = V<J ^ 

zeX zeX 

The transportation cost distance between p and v, denoted here by r(p, v) = t^^Ou.v) (and also referred 
to in the literature as the Wasserstein 1 distance, Monge-Kantorovich distance, or the Earthmover distance), 
is 



{jx> 



r(p, v) := inf \ d x (x, y) dn(x, y) : n e U(p, v) \ . (5) 
IXxX ) 

For p € p + (X) = p~(X), so we may write ||yu|| T := T(p + ,p~). This is easily seen to be a norm on the 
vector space ^^ } T := {p £ : ||ju|| T < ooj. 

Fix some xq e X, and let Lip (X) = Lip XQ (X) be the linear space of all Lipschitz mappings / : X — > R 
with /(xq) = 0, equipped with the norm || • Hup (i.e. the norm of a function equals its Lipschitz constant). 
Any p € ~^x T can be thought of as a bounded linear functional on Lip (X), given by / i-» J x fdp. The 
famous Kantorovich duality theorem (see Theorem 1.14 in |53|) implies that Lip (X)* - ^f^ T , in the sense 
that every bounded linear functional on Lip (X) is obtained in this way, and for every p e , 

IMIt = \Hu Pq( xy ■= sup [JT/^ = / 6 Lip (X), ||/|| Lip < 1 j . 
(We note that this identity amounts to duality of linear programming.) 



3 Proof of Theorem O 

Fix an integer n > 2 and denote X = {0, 1, . . . ,n - 1} , equipped with the standard Euclidean metric. In 
what follows, for concreteness, Lip := Lip (X) is defined using the base point xo = (0, 0). Also, for ease 
of notation we denote .M = -^xr Ot> serve that Lipo and -M are vector spaces of dimension n — 1, and by 
Kantorovich duality, Lipp - ^# and ^* - Lip . 
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Assume that F : ^x — » ^1 is a bi-Lipschitz embedding, satisfying for all two probability measures 
p., v e &> x , 

tQi,v) < \\F(p) - F(v)h <L-T(fi,v). (6) 

Our goal is to bound L from below. We begin by reducing the problem to the case of linear mappings. 
Recall that given two normed spaces (Z, || • \\z) and (W, || • ||w), the norm of a linear mapping T : Z — > W is 
defined as ||r|| = sup-. 6Z \ (0 | (observe that in this case ||r|| = ||r||Lip). 

Lemma 3.1 (Reduction to a linear embedding of j% into Under the assumption of an existence of 
an embedding F : &x —> L\ satisfying ©, there exists an integer N, and an invertible linear operator 
T : j& — » with \\T\\ < 2L and \\T(p)\\i > \\p\\ T for all p € (the factor 2 can be replaced by I + efor 
every s > 0, but this is irrelevant for us here). 

Proof. By translation we may assume that F maps the uniform measure on X to 0. For p e ^£ denote 
H/illoo := maXjcex \p(x)\. Observe that it is always the case that \\p\\co < \\p\\ T - Indeed, if n e Tl(p + ,p~) then 

f \\x-y\\ 2 dn(x,y)> f dn(x,y) = p\X) = p~(X) > ||/x|U. 
JXxX JXxX 

Let Bjp denote the unit ball of jfa '. Define for p € Bj( a probability measure ipipi) € &(X) by ^{p){x) '.- 
^ 2 ■ It is clear that for every v e ||/i-v|| r = ■\-\\ifr(p.)—ifr(v)\\ T . The mapping /j := n 2 -Fo\p : B # — > Li 
satisfies /i(0) = 0, ||/j||Li P < and ||/j(/i)-/z(v)||i > ||//-v|| r . This implies that there exists a map h : ^# — > L\ 
satisfying the same inequalities. We shall present two arguments establishing this fact: The first is a soft 
non-constructive proof, using the notion of ultraproducts, and the second argument is more elementary, but 
does not preserve the Lipschitz constant. 

Let ^ be a free ultrafilter on N, and denote by (L{)<% the corresponding ultrapower of L\ (see Q6| for 
the necessary background on ultrapowers of Banach spaces. In particular, it is shown there that (L\)a// is 
isometric to an L\(cr) space, for some measure cr). Define for p e h{p) - (j ■ h(p/j))°° =l where we 
set, say, h(v) = for v € ^ \ B,£. Then, by standard arguments, pHup < L and p Hup < 1- Moreover, 
h(^) spans a separable subspace of (L\)%, and thus we may assume without loss of generality that h takes 
values in L\ . 

An alternative proof (for those of us who don't mind losing a constant factor), proceeds as follows. For 
every / e L\ letx(f) : [0, l]xl-> {-1, 0, 1} be the function given by 

( 1 f(s) >0,0<t< f(s), 
X (f)(s, t) = sign(/( J )) • l[ ai/(j)| ](f) = -1 f(s) <0,0<t< -f(s), 

{ otherwise. 

It is straightforward to check that lUrC/l-A'OOIlLiao^xR) = for every f,g e L\ (We note here that the 

space Li([0, l]xR) is isometric to L\.) Define h : Jt — > Li([0, l]xl) by setting h(p) - IH r -;f °^(/VIMIt) 
for p € j# \ {0}, and h(0) = 0. Since for every / e L\,x(f) takes values in {— 1, 0, 1}, we have the following 
pointwise identity for every /i.vei' with \\p\\ T > \\v\\ T : 

\h(p) - ~h(v)\ = \\V\\ T ■ 



y o h I — 1 — v ohi — 
^ 'iMIr 



+ (IMIt - IMIr) • x°h 
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Thus 



j|/^)-/l(v)|| Li([ai]xR) = |M| T 



h\^-\-h[ ' 



IMIr 



+ (IMIr -IMIr) 



(7) 



> |M| T - 

> Hv-ju|| T 
= l|v-ju|| T . 

It also follows from the identity © that 



lr IM| T 
IM| t 



+ 



- v 



+ IMIr - IMIr 



\h{p) - h(v 



Li([0,l]xR) 



< L||v|| T 



*llr IM| T 

< L|M~v|| T + L||v|| T |M 

< 3L||/i-v|| T . 



+m-v\\ T 



1 



1 



IMIt IMIr 



+ L\\fX - V\\ T 



We are now in position to use a Theorem of Ribe [44] (see also [T7|, and Corollary 7.10 in for 



softer proofs), which implies that there is an into linear isomorphism S 



LV satisfying ||5|| < L 



and US -1 1| < 1. Since ^ is finite dimensional, by the principle of local reflexivity [33] (alternatively by 
Kakutani's representation theorem [27 , 34 1), and a simple approximation argument, we get that there exists 
an integer N and an into linear isomorphism T : J% — > €^ satisfying \\T\\ < 2L and ||r _1 || < 1 (the value 
of N is irrelevant for us here, and indeed it is possible to conclude the proof without passing to a finite 
dimensional L\ space, but this slightly simplifies some of the ensuing arguments. For completeness we note 
here that using a theorem of Talagrand [ 50 1 we can ensure that ,/V = 0(n log n)). □ 



From now on let T : ^ — > l N x be the linear operator guaranteed by Lemma l3~Tl Since T is an isomor- 
phism, the adjoint operator T* : — > = Lip is a quotient mapping, i.e. ||r*|| < 2L and the image 
of the unit ball of under T* contains the unit ball of Lip . We now define three more auxiliary linear 
operators. The first is the formal identity Id : Lip — > W, where W is the space of all functions / : X — > R 
with f(0) = 0, equipped with the (discrete Sobolev) norm 

n— 1 n-2 n—l n-2 

ll/llw == 2 Z f {i > j)l + E E m + f{i ' j)l 

(=0 7=0 j=0 1=0 

The second operator is also a formal identity (discrete Sobolev embedding) S : W — > li(X), where the 
Euclidean norm on €%QC) is taken with respect to the counting measure on X. The final operator we will use 
is the Fourier operator & : € Z (X) -> € 2 (X), defined for / : X -> R by 

&(f)(u,v) := ^ E /^'^ sin 

" (fc/)eX 

The following lemma summarizes known estimates on the norms of these operators: 

Lemma 3.2 (Operator norm bounds). The following operator norm bounds hold true: 
.||Id||<2n(n-l). •||5||<1. • \\&\\ < I 



(2nuk\ /2ttvA 
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Proof. The first statement means that for every / : X — > R with /(0) - 0, < 2n(n - l)||/||Lip, which 
is obvious from the definitions. The second assertion is that H/H2 < \\f\\w- This is a discrete version of 
Sobolev's inequality FTP (with non-optimal constant), which can be proved as follows. First of all, since 
/(0) = 0, for every (u, v) € X, 



l/(«,v)| - 



u-l v-1 



2 [/(* + 1, v) - f(k, v)] + £ [/(0, ^ + 1) - /(0, 0] 



n-2 n-2 



< £|/(*+l,v)-/(*,v)| + 2l/(0,£+l)-/(0,O| :=A(v). (8) 

fc=0 f=0 

Analogously, 

n-2 n-2 

|/(u, v)| < 2 |/(«, ^ + 1) - /(«, 01 + J] +1,0)- /(*, 0)| := (9) 
Multiplying © and ©, and summing over X, we see that 

in— 1 \ /n— 1 v , /n-l n-l 



< J] A(v)tf (w) = (V A(v)) • fi(u)] < A(v) + J] 5(h)] < 1 (2||/|| w ) 2 

(«,v)eX H-=0 ' \i=0 ' 'v=0 u=0 ' 



The final assertion follows from the fact that the system of functions y(k, €) i-> sin (^p) • sin (^^)} ( . )6X 
are orthogonal in €i(X) and have norms bounded by n. □ 

We now recall some facts related to absolutely summing operators on Banach spaces (we refer the 
interested reader to l5Tll54l for more information on this topic). Given two Banach spaces Y and Z, the n\ 
norm of an operator A : Y — > Z, denoted tti(A), is defined to be the smallest constant K > such that for 
every m e N and every yi, . . . ,y m £ ^ there exists a norm 1 linear functional y* e Y* satisfying 

m m 

J^WAyjWz^Kj^lfiyj)]. (10) 

7=1 7=1 

This defines an /JeaZ norm in the sense that it is a norm, and for every two operators P : W — > Y and 
Q : Z -> V we have ^i(gAP) < ||g|| • ^i(A) • ||P||. Observe that it is always the case that n x (A) > \\A\\. 

Lemma 3.3. Using the above notation, 7Ti(Id) < 1n(n — 1). Therefore, Lemma [PI implies that 

m(^oS o Id o T*) < AnL. 
Proof Fix /!,... ,f m ; X -» R with /rfO) = • • • = / m (0) = 0. Then 

m n-l n-2 m 

7=1 s=0 f=0 7=1 

< 2n(n - 1) max \ max V t + 1) - //j, 01, max V |//f + 1 , 5) - s)\ \ . 

0<.v<n-l z — ' ()<.«<«- 1 z — ' 

(0<?<n-2 7=1 0<f<n-2 7=1 J 

Assume without loss of generality that the maximum above equals Y!]=\ l/jC^O* f o + 1) _ //(so, ?o)U f° r some 
< so < n - 1 and < to < n — 2. Consider the measure /j. = <5( Jo> ; 0+ i) - <5(s ,«o) £ ^ - Lip^. One checks 
that \\h\\t = 1, and £"' = i [/)(j , to + 1) - /jOo, ?o)l = Zy=i IM//)!. implying the required result. □ 
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The fundamental property of the n\ norm is the Pietsch Factorization Theorem (see [51 ]), a special case 
of which is the following lemma. We present a proof for the sake of completeness. 

Lemma 3.4 (Pietsch factorization). Let Y be a Banach space, and fix a linear operator A : — > Y. Then 
there exists a probability measure o~ on {\, . . . ,N) and a linear operator R : L\ (cr) — » Y such that A = Ro I, 
where I is the formal identity from £^ to Li(o~), and \\R\\ = Tt\(A). 

Proof Recall that A : £* -> Y satisfies for all x\, . . . , x m e £™, 



m m 



V \\Axi\\ < m(A) ■ sup V \x*( Xi )\ = m(A) • max V \xj(k)\, 
11^11=1 

where the last equality follows from the fact that the evaluation functionals x i-» x(k) are the extreme points 
of the unit ball of ^ = U£f '. Consider the two subsets of R N : 

(i m m \N \ 

|g ||Axi|| - m<£) J] |jt*(*)|J : meNand x\,...,x m eC\, 



K 2 = \x e ^ : x(k) > for all 1 < k < n\ . 



and 

Note that and K 2 are disjoint convex cones with K 2 open. It follows from the separation theorem that 
there is a non zero cr £ €^ such that cr(x) < for all x € K\ and <r(x) > for all x e K 2 . The second 
inequality implies that cr is positive; we can then assume, by renormalizing, that it is a probability measure 
on {1, . . . , Af}. The first inequality implies that 



||Ajc|| < m(A) f \x(k)\do- 

J{l,..„N} 



for all x e R N . Define Rx = Ax. 



From now on let R and cr be the operator and probability measure corresponding to A = ^ r o5'oIdor*in 
Lemma l3~4l Thus Ro I = ^ o S oldoT* and \\R\\ < 4nL. Schematically, we have the following commuting 
diagram: 

C — Li Po — W — h{X) 4 UX) 




U{cr) 



We need only one more simple result from classical Banach space theory. This is a special case of a 
more general theorem, but we shall prove here only what is needed to conclude the proof of Theorem ll.il 

Lemma 3.5. Let R : Li(cr) — > i 2 be a linear operator. Fix f : — > [0, oo). Then there is x € £ 2 with 
non-negative coordinates such that 

R({g : R N -» R : Vj, \g(j)\ < f(J)}) Q {y 6 t 2 : Vj, \yj\ < Xj }, 

and\\x\\ 2 <\\R\\-\\f\\ L , ((T) . 
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Proof. R is given by a matrix (Rjj : i = 1, . . . ,N, j e N). In other words, for every j, (Rf)j - Tii = \ RijfiO- 
Observe that using this notation, 



1 

\\R\\ = max I V R' } 

i<i<N\o-(i) 2 ^ ,} 

7=1 



oo s 1/2 

,2 



Fix g € Li(cr) such that for all i e {1, . . . , N}, < /(/). Then for all j, 



\(Rg)j\ < |/? i7 |/(/) := *y. 



;'=1 



Now, 



Wl2 



OO / W 



;=1 V /=1 



1/2 



where we have used dTTb . 



(11) 



/V / oo > 1/2 n / co ,1/2 



We are now in position to conclude the proof of Theorem ll.il 

Proof of Theorem I7~71 For (u, v) € {1, . . . , n} 2 define tp luv : X — > R by 

1 [2nuk\ {Invl 

(Pu, v (k, l) := sin • sin 

u + v \ n j 

Then (p u>v (0) - and one computes that ||^ M ,vl|Lip < ^r- By the fact that T* maps the unit ball of ^ onto the 
unit ball of Lip , it follows that there is M>V e with ||0 M , v ||oo < — and T*<p u ,v - <fu,v Now, the functions 
\I(<Pu,v)\ e L\(o-) are point- wise bounded by the constant — , so by Lemma EOl there exists x e C%{X) of norm 
at most ^\\R\\ < \6nL such that \R(I(<f> UiV ))\ is bounded pointwise by x. But, 



RoI(4> u ^(s,t) = &oS oHor(^ v )(i,t) 



Observe that 



So, 



= 1 y — • 



1nuk\ . [2nvt\ . {Insk 
sin I -sin • sin 



n 



n 



(k,t)eX 

ji ■ ' IK" + V )^".vlli(JT) ( J > f ) = v )' 

(s,f) * (u,v). 



u + v)<p UtV \\l (X) = 2, sin ^— j-sin [—} = J 



(k,[)eX 



sin 



RoI(cf> KV )(s,t) 



(M) = («,v), 
(5, f) + (u, v). 
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But 



n-l , , 

log n 



H6nLf > ||x|| > J x£ v > J] [* o /(^ v)(M , v)] 2 = 1 £ — L- > ^ . 

H,V=1 M,V=1 «,V=1 

where the last bound follows from comparison with the appropriate integrals. The proof of Theorem ll.ll is 
complete. □ 



3.1 Discretization and minimum weight matching 

In this section we deduce Theorem 11.21 from Theorem 11.11 The main tool is the following theorem of 
Bourgain [6|, which gives a quantitative version of Ribe's theorem [44]. 

Theorem 3.6 (Bourgain's quantitative version of Ribe's theorem |6|). There exists a universal constant 
C with the following property. Let Y and Z be Banach spaces, dim(F) = d. Assume that is an e-net in 
the unit hall ofY,f:& — > Z satisfies dist(/) < D, and that log log - > Cd log D. Then there exists an 
invertible linear operator T : Y — > Z satisfying \\T\\ ■ ||r _1 || < C ■ D. 

Proof of Theorem 17721 Observe that for every p e the measure • ® pT) is in Il(p + ,p~). Thus 

IMIt < —^7 f II* - y\\ 2 dp + (x)dp-(y) < V5 • (n - 1) • p + (X) < In ■ |supp(^ + )| • Halloo < 2« 3 |H co . 
J xxx 

On the other hand, as we have seen in the proof of Lemma |3~T1 for every p e \\p\\oo < \\p\\ T - It follows 
from these consideration, and Theorems 11.11 and 13.61 that for every integer N > e gC " loglog " ) the set of 
probability measures <3f £ & x consisting of measures p e & x such that for all x e X, p(x) = k/N for some 
k € {0,...,N}, satisfies c\ {& ' , t) = £l( ^logn). We pass to a family of subsets as follows. Let M be an 
integer which will be determined later. For every p e & we assign a subset S M c {0, . . . , nM} 2 as follows. 
For every (u, v) € X = {0, . . . , n - l} 2 , if p(u, v) = k/N, where k £ {0, . . . , Af}, then 5^, will contain arbitrary k 

distinct points from the set (uM, vM) + jo \ V^v]} 2 . Provided M > 4 V^V, the sets {Sp}^^ thus obtained 

are disjoint N point subsets of {0, ... , nM} 2 , and it is straightforward to check that the minimum weight 
matching metric on {S^^oy is bi-Lipschitz equivalent to (ty , r) with constant distortion. □ 



3.2 Uniform and coarse nonembeddability into Hilbert space 

In this section we prove Theorem 11.31 We shall prove, in fact, that the space -^[o,i]2, T does not embed 
uniformly or coarsely into L2. We first recall the denntions of these important notions (see HJEl an d 
the references therein for background on these concepts). Let (X, dx) and (F, dy) be metric spaces. For 
/ : X -» Y and t > we define 

Q f (t) = sup{d Y (f(x),f(y)); d x (x,y) < t], 

and 

u> f (t) = M{d Y (f(x), f(y)); d N {x,y) > t). 
Clearly and a>f are non-decreasing, and for every x,y e X, 

OJ f (d x (x,y)) < d ¥ (f(x),f(y)) < Q f (d x (x,y)) . 
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With these definitions, / is uniformly continuous if lim ; _>o fit) = 0, and / is said to be a uniform embed- 
ding if / is invertible and both / and / are uniformly continuous. Also, / is said to be a coarse embedding 
if Clfit) < oo for all t > and lim^oo (oAt) - oo. 

In what follows we will use the following standard notation: Given a sequence of Banach spaces 
• llz ; )}._ 1 the Banach space (®°° =l Z/)^ is the space of all sequences z = izj)J =1 £ YlJ = i^j sucn 
that ||z|| := £°° =1 \\zj\\ Zj < oo. If for every j e N, Zj = Z u we write l x {Z x ) = (0°° =1 Zj\. 

Theorem 3.7. The spaces n } 2 t< do not admit a uniform or coarse embedding into L2 with moduli 



n=i 



uniformly bounded in n, i.e., there do not exist increasing functions to, Q : [0, 00) — > [0, 00) which either 

satisfy lim f _»o oj(t) — lim f ^o ^(0 — 0, or lim^o, aj(t) = 00, and mappings f„ : 2 — > L2, such that 

\\j,...,n) 



- vllr) < 11/nO") - fniv)h < WWn ~ v\\ T )for all p,v€ nl 2 and all n. 

Proof. If this is not the case then by passing to a limit along an ultrafilter we easily deduce that 2 ^ uni- 
formly or coarsely embeds in an ultraproduct of Hilbert spaces and thus in L2 (see [ 16 , 17 1). By a theorem of 
Aharoni, Maurey and Mityagin [ 1 ] in the case of uniform embeddings, and a result of Randrianarivony [43 ] 
in the case of coarse embeddings, this implies that 2 is linearly isomorphic to a subspace of Lq. By a 

theorem of Nikisin 13^1 it follows that ^t9 Q 2 is isomorphic to a subspace of L\- E for any e € (0, 1). We 
recall that it is an open problem posed by Kwapien (see the discussion in I28I I4I) whether a Banach space 
which linearly embed into Lq is linearly isomorphic to a subspace of L\. If this were the case, we would 
have finished by Theorem ll.il Since the solution of Kwapien's problem is unknown, we proceed as follows. 
Let {S j}°° =l be a sequence of disjoint squares in [0, l] 2 with 

d{Sj,Si c )= min \\a - b\\ 2 > max IdiamS ,-, diamSu ■ (12) 

aeSj, beS k > 

Consider the linear subspace Y of , consisting of all measures p satisfying supp(//) c M'f 1 S ; and 

|_U, lj" J 1 J 

p(S j) = for all j. It is intuitively clear that in the computation of \\p\\ T for p e Y the best transportation 
leaves each of the S j invariant; i.e., it is enough to take the infimum in (J5} only over measures n € H(p, v) 
which are supported on \J°° =[ (S j XS j). This is proved formally as follows: Fix p € Y and write p - Yj°jL\ Af/« 
where supply) c S • and pj(S f) = for all j e N. We claim that 

00 

INI[0,iP,T = 2 | WISy,T. (13) 

7=1 

If Uj € n(juj,/iT) then n := Y^ =x n i € n 0" + >/O- Thus IN[0,i] 2 ,r ^ Y^=\ IMIs ; ,r- To prove the reverse 
inequality take n e Tl(p + ,p~). For every j = 1,2, .. . define a measure cr j on Sj as follows: For A c S j set 
o~j{A) := 7T^A x [jk+j Sk\ Thus, in particular, by our assumption (IT^t for every y e S j, 



I \\x - yWido- j{x) = \ \\x-y\\ 2 dn(x,z) < I \\x-zhMx,z). (14) 

Writing 



00 ^ 00 



7=1 



^7) jrf tt(S ; x Ufat^jt) 



o-j ® o-j, 
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it follows from our definitions that n € YI(ju + ,ij. ) and n is supported on \J°° =i {S j x S j). Moreover, for each 
j, nj := n\ s . € n(ju|,//J), so that 

CO CO ^ 

EWky.T < X I ||x-y|| 2 ^(x,y) 

f l|x-ylM7r(x,y)+ V — 7 • f ||x - y\\ 2 d(T j{x)dcr j(y) 

m r vi r 

< ||x-y|| 2 rf7r(x,j)+ > ||x-z|| 2 ^(x,z) 



I ||x -.y|| 2 <i7r(x,;y). 



'(U£ , S;)x(U7 =] Sj) 

This concludes the proof of dl3b . It follows that Y is isometric to (@^Lj T ) J , which in turn is isometric 

to £\ jj 2 T j- Now, Kalton proved in f2"""ll that if for some Banach space X, £\{X) is isomorphic to a 
subspace of Lo, then X is isomorphic to a subspace of L\ and we finish by Theorem ll.il □ 

Proof of Theorem [PI Assume for the sake of contradiction that there exists C < oo such that for all n e N, 
c 2 (^{o,...,n} 2 ' Vn < By the proof of Lemma "3~T1 we know that the unit ball of ^jo,..., n } 2 ,r * s isometric to 
a subset of n } 2 > T )- Thus by our assumption there exist mappings /„ : ^# j0 ;i) 2 — > L 2 such that for 

every v e ^ { o,...,„}2 with |H r , ||v|| r < 1, 

Vlly" - Vllr < 11/nO") - /„(V)|| 2 < C • VllA* - V||r ■ (15) 

Let % be a free ultrafilter on N. Define : „p — > (L 2 )^ by = ( V7 " frSP-IJ)) ._j In- 

equalities O imply that all //, v € ^o,...,^ satisfy Vlly" 1 v llr < - /«(v)II(l 2 )^ < C • a/Hj" ~ v llr- 

Since the ultrapower (Jj£)<% is isometric to a Hilbert space (see [16]), we arrive at a contradiction with 
Theorem 13.71 □ 

Remark 3.1. We believe that Theorem fO]can be made quantitative, i.e. one can give explicit quantitative 
estimates on the rate with which c 2 (^'{o > ... >n p, V*j tends to infinity. This would involve obtaining quanti- 
tative versions of the proofs in fTl l2"8l 133*1 . which seems easy but somewhat tedious. We did not attempt to 
obtain such bounds. 

Remark 3.2. We do not know whether (^[ 01 ]2,r) admits a uniform embedding into Hilbert space. The 

proof above actually gives that for all a € (0, 1], (^[o if T , T a ) does not embed bi-Lipschitzly into Hilbert 
space. But, our proof exploits the homogeneity of the function t t a in an essential way, so it does not 
apply to the case of more general moduli. 

4 Upper bounds via Fourier analysis 

In this section we prove Theorem 1 1.41 and discuss some related upper bounds. Given a measure fi on l} n we 
decompose it as in ("3, and we consider the linear operators A and B, from ^# Z 2 to L\ (z^J, defined in ("3 
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and ©, respectively. One checks that the duals of these operators, A*, B* : L\ fejj 
given by 



LiPo(z^), 



are 



e " — 1 



(w,v)eZi\{(0,0)} 



and 



*7 = 2 



(h,v)6Z,1\{(0,0)} 



e » - 1 + e « - 1 



_ 27171' 

e « - 1 

I 2;m/ |2 I 2mv I 

\e " — 1 + e « — 1 



/(w,v) • (<? uv - 1), 



(16) 



■ f(u,v) ■ (e uv - 1). 



(17) 



To check these identities the reader should verify that for all /i € ^# Z 2, J z2 fd(Afi) = f z2 (A*f)dfi, and 

similarly for B (to this end, recall that fi (z%) = 0, so that ^(0, 0) = 0. This explains the subtraction of 1 in 
the identities (fT6l and (flTt). 

We claim that for every /u e , 



^ll^llL 1 (^) + H^llz. 1 (^)^Clogn- 



(18) 



where C is a universal constant. This will imply Theorem 1 1 .41 since the mapping fi — U, where U is the 
uniform probability measure on Z%, is an isometric embedding of .^ s z ~ mt0 ■•^z?- 

By duality, (fT£t is equivalent to the fact that the mapping (f, g) \-* A*f + B*g from (Z^J © fe;) to 
Lip (Z^J is a C log n quotient map, i.e. for every (/, g) e (Z^J © (Z^J 



\\A*f + B*g|| L . p < C log n • max {||/|U, ||g|| 



(19) 



and for every h e Lip (z~\ there is some (/, g) e \ Z~] © L m fejj satisfying A* f + B*g = h and 
maxlll/Hoo, Hgllco) < Pllup- The second assertion is proved as follows: Take / = d\h and g = djh, where for 
j = 1,2, djh{x) = h{x + ej) - h{x) (here e x = (1,0) and e 2 = (0, 1)). Clearly ||/|U, ||g|L < ||/j|| Lip , and 



A7 + fl*g = ^ 

(«,v)eZg\{(0,0)} 



- l) • d\h(u, v) + {e " _ l) • dih{u, v) 



(e uv ~ 1) 



(« 



/ / _ ITTIU 

e » 



(«,v)€Z2\{(0,0)} 

(«,v)eZj\{(0,0)} 
^ /j(m, v)e 

(«,v)€Zj 



\e " — 1 + e » — 1 
l) . (e 2 ? - l) + -!)•(,¥ -1)^1 



|e « — 1 + \e » — 1 
^ fc(w, v) 

(»,v)eZ2\{(0,0)} 



h(u,v)(e uv - 1) 



ft, 



where we used the fact that h(0) = 0. 

It remains to prove dl9b . To this end, it is enough to show that ||A*/||Li P < 0(log n) ■ H/IU and ||B*g||Li P < 
O(logn) • Uglloo. We will establish this for A*- the case of B* is entirely analogous. Observe that 



IIAVIbp < PiA*/|U + \\d 2 A*f\U 
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so it is enough to establish the following two inequalities: 



I IsilL |2 
\e « - 1 



(«,v)€Z^\{(0,0)} 



I 2tt/» |2 I 2;rh' |2 

e « - 1 + \e n — 1 



< 6>(log n) 



(20) 



and 



-!).(*¥ -i) 

2j ~ ,9 , ^ • /(«. V K 



( M ,v)eZ£\{(0,0)} 



I 2m'» |2 I 2njv |2 

e " — 1 + \e " — 1 



< 6>(log n) 



(21) 



Since for > the norms on (z^j and L p (Z^J are equivalent with constant n 2 l p (by Holder's inequality), 
it is enough to show that for p > 2, 



2 



I I7TIU I 

\e — 1 



(«,v)€Z^\{(0,0)} 



I |2 I 27riv |2 

\e " — 1 + e i — 1 



<0(p) 



(22) 



and 



v (e-^-l)-(e^-l) „ 

2j — ■ — ~ ' /( "' v)eu 



(«,v)sZg\{(0,0)} 



\e " — 1 + e « — 1 



<0(p) 



up- 



(23) 



To prove inequalities d22l and d23l we will assume that n is odd (all of our results are valid for even n as 
well, and the proofs in this case require minor modifications). We think of Z\ as [—(n - l)/2, (n - l)/2] n Z. 
As before, given m : Z„ — » C we denote 

9i/n(jc,y) = m(x + — m(x,y), and d2in(x,y) = m(x,y + I) - m(x,y). 



Thus 

d\m(x,y) = m(x + 2,y) — 2m(x + l,y) + m(x, y) and ^/^(x,^) - m(x,y + 2) - 2m(x,y + 1) + m(x,y), 



and 



d\dim(x,y) = d2d\m(x,y) = mix + l,y + 1) - m(x + l,y) - m(x,y + 1) - m(x,y). 



In what follows we think of m as a Fourier multiplier in the sense that it corresponds to a translation 
invariant operator T m on L2 fejj given by 



T m (f) ^ m(w, v) • /(m, v) • e MV , 



(24) 



(M,v)eZ^ 



Recall that an operator T : Li (z£j — > Li (Z^j is said to be weak (1, 1) with constant A" if for every 
/ : Z 2 n -> C and every a > 0, 

(((ii,v)eZ2 : |r/(n,v)|>fl)|<|-|l/lli = f- 2 l/(»,v)|. 

(«,V)€ZJ 



We will use the following discrete version of the Hormander-Mihlin multiplier theorem l3lfl[T%ll . 
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Theorem 4.1 (Hormander-Mihlin multiplier criterion on Z 2 ). For j e N denote Qj = [-V, V]x[-V, V\ 
Fix B > and m : Z 2 , — > C vv/f/z m(0, 0) = 0, arcc? assume that for all j = 0, 1, . . . , |_log 2 (rc - 1)J - 1, 

J] [2- 2j \m(u, v)\ 2 + |5im(«, v)| 2 + \d 2 m(u, v)| 2 + 

(",v)e(ej\ej-i)nz2 

2 27 |5 2 m(M,v)| 2 +2 2i \djm(u,v)\ 2 + 2 2j \d 1 d 2 m(u,v)\ 2 ] < B 2 . 
Then the translation invariant operator T m corresponding to m is weak (1,1) with constant 0(B). 

While the continuous version of the Hormander-Mihlin multiplier theorem is a powerful tool which 
appears in several texts (e.g. in the books [ 12 48 52 1), we could not locate a statement of the above discrete 
version in the literature. It is, however, possible to prove it using several minor modifications of the existing 
proofs. The standard proof of the Hormander-Mihlin criterion is usually split into two parts. The first part, 
which is based on the Calderon-Zygmund decomposition, transfers virtually verbatim to the discrete setting- 
see Theorem 3 in Chapter 1 of [48 ], and Remark 8. 1 there which explains how this part of the proof transfers 
from R" to the setting of finitely generated groups of polynomial growth (in fact, the Calderon-Zygmund 
decomposition itself, as presented in Theorem 2 in Chapter 1 of [48 1, is valid in the setting of general metric 
spaces equipped with a doubling measure). The second part of the proof of the Hormander-Mihlin theorem, 
as presented in Theorem 2.5 of [18], requires several straightforward modifications in order to pass to the 
discrete setting. We leave the simple details to the reader. For the sake of readers that are not familiar with 
these aspects of Fourier analysis, we will later present a complete reduction to a continuous problem whose 
proof appears in print, which yields slightly worse bounds on the distortion guarantee. 

In order to apply Theorem l4.1l we consider the following two multipliers, 

\e~ - 1 [e n - 11 • le n - 1 1 
m\(u,v) := - : -, and ni2(u,v) := - r, (25) 

lv ' ' I 2mu |2 I 2mv |2 v ' I 2mu |2 I 2S1 , |2 

\e '• — 1 + \e >• — 1 \e « — 1 + \e » — 1 

where we set mi(0, 0) = ni2(0, 0) = 0. A direct (albeit tedious!) computation shows that m\ and ni2 satisfy 
the conditions of Theorem l4. ll with B = 0(1). Thus, the operators T m and T m are weak (1,1) with constant 
0(1). Since m\ and ni2 are bounded functions, the operator norms ||r mi \\l 2 (z 2 )->l 2 (z 2 ) an( ^ ll^m 2 llL 2 (z 2 )->L2(z 2 ) 
are 0(1). Since these operators are self adjoint, by the Marcinkiewicz interpolation theorem (see [56 1) it 
follows that for p > 2, the operator norms \\T mi \\ Lp(7 2^ Lp(Z 2^ and II^JI^^)^^^) are O(p). This is 
precisely d2~2T) and d23t . 

The above argument is based on Theorem 14. II which does not appear exactly as stated in the literature, 
but its proof is a straightforward adaptation of existing proofs (which is too simple to justify rewriting the 
lengthy argument here). However, making the necessary changes easily does require some familiarity with 
Calderon-Zygmund theory. We therefore present now another argument which gives a polylog(?i) bound on 
the distortion, but uses only statements which appear in the literature. This alternative approach appears to 
be quite versatile, and might be useful elsewhere. 

The following lemma reduces the problem of proving inequalities such as d2"2T) and d2"3l (with perhaps 
a different dependence on p) to a continuous inequality. The argument is based on the proof of a theorem 
of Marcinkiewicz from [56] (see Theorem 7.5 in chapter X there). In what follows we denote by T the 
Euclidean unit circle in the plane. 
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Proposition 4.2 (Transferring multipliers from the torus to Z 2 ). Fix an odd integer n. Let {A(u, v)}~ =0 
be complex numbers such that A(u, v) - Ofor max{w, v} > n. Consider the operators M : L p (l 2 ) — > L p (t 2 ) 
and M n : L p (z 2 ) — > L p (Z 2 J g/ven fry 



M 



^ /(",v)< 



2ni(ux+vy) 



\u,v=-oo 



2 i( M ,v)/( M ,v) e 2m '(" 



ant/ 



Then, 



( n-\ 



M„ 



L (ua+vb) 



\u,v=0 



u,v=0 



n-1 



— 5 

A(u,v)f(u,v)e~ 



L (ua+vb) 



H,V=0 



H M nllL ;) (Z,2)^L„(z2) ^ 8 1 • II^IIl p (T2)^L ;) (T 2 )- 

Proof. The proof is a variant of the first part of the proof of Theorem 7.5 in chapter X in [56 1, and a small 
twist on the second part. Since the terminology in [56] is different from ours, we repeat the proof of the first 
part as well. Recall that the Dirichlet kernels D e : [0, 1] — > C are defined as, 



D t {x) = ^ e 2 ^ x , 
h-e 



and the Fejer kernels K m : [0, 1] 



are 



K, 



r(x) = 7YD ( X= V 1 



1=0 



j=-m 



m + 1 



Jlnijx 



A basic property of D[ is that for any trigonometric polynomial S (x) of degree at most £, namely S (x) = 
Ylj = -c aje 2m i x , we have that S (x) = S * D({x) = S {t)D({x - i)dt. The same is true with any other function 
all of whose /fh Fourier coefficients for j between —I and I are 1 ; in particular for the de la Vallee Poussin 
kernel 2K 2 c-\ - K[-\ (see [29]). The well known advantage of the Fejer kernel over the Dirichlet kernel is 
that it is everywhere (real and) nonnegative. Note also that J K m {t)dt = 1 for all m. Thus, by convexity of 
the function t p , for any trigonometric polynomial S of degree at most €, and for all x e [0, 1], 



|5 W| 



p - 



\2S *K 2e -i(x)-S *K t {x)\P 



< y 



2 r l 

3 Jo 



\S(t)\ p K 2{ -i(x-t)dt + 



If 1 

3 Jo 



\S(t)\ p K e _i{x-t)dt 



(26) 



Let now <jl>2c+\ be the measure which assign mass j^rr to each of 2^ + 1 equally spaced points on [0, 1]. 
Then it is easy to check that 

J K m (x - t)daj 2 e+i(x) = I K m (x - t)dx - 1 
Jo Jo 

for all m <2l and for all t € [0,1]. Integrating d26l with respect to 0) 2 (+u we get that for any trigonometric 
polynomial S of degree at most i 



f 

Jo 



\S(x)\Pdoj 2 e + dx)<3 p f \S(x)\Pdx 
Jo 



(27) 
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It follows that if S(x,y) is a two-variable trigonometric polynomial of degree at most I in each of the 
variables, i.e. S{x,y) = Yf u v= _ ( a^e™"*^ , 

f \S(x,y)\Pdaj 2 e + i(x)dto 2M (y) <9 P f \S(x,y)\ p dxdy. 

J[0,1] 2 J[0,1] 2 

It follows from this that, since n is odd, for every / e L p (l 2 ), 



( n-l 



Mn 



— 2m 

f(u, v)e » 



(ua+vb) 



M,V=0 



<9 



L p (Z 2 ) 



M 



£ f{u,v)t 



2ni(ux+vy) 



Note that for each trigonometric polynomial of the form P(x,y) - YJl v =- n +i ^uv^ 2m(ux+vy \ 

J P(x,y)do) n (x)da> n (y) - oq - I P(x,y)dxdy. 
J[0,1] 2 J[0,1] 2 

Fix f e L p (Z 2 ), 1 < p < oo. By the first part of the proof and duality, there is g € L p «(T 2 ) (p* - p/(p - 1)) 
with \\g\\ p * = 1 such that 



( n-l 



W M nf\\ Lp (Zl) 



2ni(ux+vy) 



9f ^Ajf(u,v 

r ( n ~ l 

i I V A(u, v)f(u, v)e 

Jimp La 



g{x, y)dxdy 



' n-l 

; [ ( »-']- W=0 

' n-l 

9 } V ,!(//. v)/(//.vk- T 

V«,v=0 V 



/ n-l 



\ 



Vu,v=0 
/ n-l 



-2ni(ux+vy) 



-2ni(ux+vy) 



Vu,v=0 



- 9 



< 9 



f n-l \ 

a 2ni{ux+vy) 



n-l 



n-l 

n-l 

A(u,v)-g(u,v)e-^ i(ux+vy) 



L 



[0,1] 2 



K,V=0 



2/ri(nx+v;y) 



J] ^( M ,v)?( M ,v)e- M(t " + ^ 

\u,v=0 



dxdy 



du> n (x)du> n (y) 



du> n (x)du> n (y) 



du> n (x)du> n (y) 



X 



< 81 



[0,1] 2 



Il„(z„ 2 ) 



M,V=0 



f 

J[0,1] 2 



da> n (x)a> n (y) 



n-l 



^ /l(w,v)g(w,v)e 



2m(ux+vy) 



u,v=0 



^ 81 • II/Hl p (z 2 ) • II m IIl„(t 2 )^l p (t 2 )- 



where the inequality before last follows from tfTTl and the last inequality (that is the fact that the norm of a 
multiplier in L p (l 2 ) is the same as the norm of the conjugate multiplier in L p * (l 2 )) follows from duality. 
The case p = 1 (and also a similar inequality for the oo norm) follows easily from the L p cases. □ 
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Proposition I4.2l implies that it is enough to obtain L p to L p bounds for the operators T m and T m2 , where 
m\,mi are as in d25l . as operators on functions on the torus T 2 . By a theorem of de Leeuw [11| it is 
enough to obtain such bounds when we think of T m and T„ n as operators on functions on R 2 (see [55 1 for 
the respective result in the case of weak (1,1) bounds). The continuous version of the Hormander-Mihlin 
multiplier theorem now applies, but unfortunately its conditions are not satisfied. However, a (once again 
tedious) computation shows it is possible to apply the Marcinkiewicz multiplier theorem (see [47||52)), in 
combination with bounds on the Hilbert transform I4"71l52ll . to obtain bounds similar to d2~2T) and d2~3l with 
O(p) replaced by 0(po\y(p)) (it is quite easy to obtain a bound of 0(p 3 ), and with more work this can be 
reduced to 0(p 2 ). However we do not see a simple way to obtain O(p) using this approach). 

Remark 4.1. Consider the mapping S : — > L\ (z 2 ) given by 

Zi Ixiu I I 2mv \ 

\e » - 1| + \e » - 1 I -fi(u,v)e uv . 

( M ,v)eZ2\{(0,0)} 

Using considerations similar to the above (see Proposition III. A. 3 in ll54l for a continuous counterpart) it is 
possible to show that S has distortion (9(polylog(?i)). However, we were unable to get this bound down to 
0(log n) as in Theorem ll.4l Nevertheless, this embedding might be of interest since it reduces the dimension 
of the ambient L\ space by a factor of 2. 



5 Discussion and open problems 

There are several interesting problems that arise from the results presented in this paper- we shall discuss 
some of them in the list below. 

1. The most natural problem is to determine the asymptotic behavior of c\ ({0, 1 . . . ,n} 2 ,rj. It seems 

hard to use the ideas in Section 0] to obtain an embedding of distortion O^log nj, as the known 
bounds on multipliers usually give a weak (1,1) inequality at best. 

2. Remark |4~T1 implies that the Banach-Mazur distance between the n 2 - 1 dimensional normed space 

t an d 1 is 0(polylog(n)). It would be interesting to determine the asymptotic behavior of 
this distance. In particular, it isn't clear whether the L\ (embedding) distortion of ^# Z 2 T behaves 

differently from its Banach-Mazur distance from 

3. We did not attempt to study the L\ distortion of i,...^v* T f° r d > 3. Observe that this space 
contains ^joj,...^} 2 ^' so tne ^ ( -\A°§ n ) l° wer bound still applies. But, the result of l3*Ull shows that 
the transportation cost metric on the Hamming cube {0, \ } d has distortion &(d), so some improvements 
are still possible. Note that in higher dimensions it becomes interesting to study the transportation cost 
distance when R d is equipped with other norms. The Banach-Mazur distance between £ d and arbitrary 
J-dimensional norms has been studied in 0|49lEl- m particular, the result of [13] states that any 
J-dimensional Banach space is at distance O (d 5 ^ 6 ^j from € d . Combining this fact with the lower bound 
on the L\ distortion of the transportation cost distance on the Hamming (^i) cube cited above, we see 
that for any norm || • || on W 1 , c\ (^(r^u-hxt) = ^ (<i 1/6 )- It would be interesting to study the dependence 
on d for general norms on E. d . 

4. As stated in Remark l3~Tl it would be interesting to study the rate with which C2 (^{o,...,n} 2 > Vm tends 
to infinity. 
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5. As stated in Remark l3~2l we do not know whether [SP^^i , t) admits a uniform embedding into Hilbert 
space. 

6. The present paper rules out the "low distortion approach" to nearest neighbor search in the Earthmover 
metric via embeddings into L\. However, it might still be possible to find nearest neighbor preserving 
embedding s into L\ in the sense of [25 ]. 

7. On the more "applied side", as stated in the introduction, there is a possibility that the embedding of 
Theorem 1 1 .41 behaves better than the theoretical distortion guarantee of O(logra) in "real life" situa- 
tions, since it is often the case that the bulk of the Fourier spectrum is concentrated on a sparse set of 
frequencies. Additionally, it might be worthwhile to "thin out" some frequencies of the given set of 
images before embedding into L\ (and then using the known L\ nearest neighbor search databases). 
It would be interesting to carry out such "tweaking" of our algorithm in a more experimental setting. 
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